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Abstract 

An extensive body of researches has favored the use of 
regression over other parametric analyses that are based on OVA. 
In case of noteworthy regression results, researchers tend to 
explore magnitude of beta weights for the respective predictors. 
This is plausible when the predictor variables are perfectly 
uncorrelated. However, as other researchers suggested, the 
failure to compute structure coefficients may cause erroneous 
interpretation in regression, when multicollinearity is present 
between predictors. This paper provides an explanation by using 
two heuristic examples, so as to emphasize the importance of 
balancing attention to both beta weights and structure 
coefficients, so that valid interpretations of regression 
results are formulated. 
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A Comparison Between The Use of Beta Weights and Structure 
Coefficients In Interpreting Regression Results 
An extensive body of researches has favored the use of 
regression over other parametric analyses that are based on OVA, 
when the objective of the research project is to explore the 
relationship between a set of independent variables and one 
dependent/outcome/criterion variable (Cohen, 1968; Courville & 
Thomspon, 2001; Friedrich, 1991; Pedhazur, 1997; Thompson, 1992; 
Thompson & Borrello, 1985; Wilson, 1980) . The advantage of 
regression lies in that it can handle both intervally and 
nominally scaled predictor variables, whether or not they are 
correlated. Furthermore, the univariate feature makes the 
results straightforward and explicit by taking the following 
mathematical model: 

Y < Y = a + b l X l +b 2 X 2 +... + b j X j . ( 1 ) 

Y is the predicted value of the criterion upon given 
predictor X (s) . Researchers have argued that all statistical 
analyses are correlational and generate additive and 
multiplicative weights applied to measured or observed variables 
that can be used to estimate the scores on latent or synthetic 
variables (Fan, 1996; Thompson, 1991) . Supposed in a simple 
regression case where there is a single predictor, the model can 



be simplified into: 
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7 < Y = a + bX . (2) 

In Equation 2, the additive weight a (sometimes referred to 
as the 7-intercept of the fit line) serves to help control the 

mean of 7 to be equal to the mean of 7; while the 
multiplicative weight b (usually termed as unstandardized 
regression coefficient or slope of the regression) functions to 
also convert the " ' spreadoutness ' of the predictor variable into 
the same metric as the 'spreadoutness' of the dependent 
variable" (Perry, 1990, p. 4) . In terms of regression, the 
primary interest in any research practice is to minimize the 
distance (i.e. e score) between the observed score Y and the 

predicted score on 7 so as to yield a perfect prediction for 
each case by weighting the predictor variables. Therefore, in a 
regression model, there are at least two measured variables ( X 

and 7 ) , and always two latent variables ( 7 and e ) . Because b 
weights are sensitive to the existence of different scaling in 
predictor variables, and the variance of measured variables, 
they are usually transformed into standardized coefficients, the 
beta weights, by the following computation: 

P = b(SD x / SD Y ) . (3) 

By doing so, the influence of the variance of the measured 
variables has been washed out. As a result, the magnitude of the 
predictor' s beta weight can be employed to identify the 
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contribution of that variable in the prediction; at least from 
one perspective. 

Uncorrelated Predictors: Heuristic Example #1 
In case when there are noteworthy results occurring, 
researchers tend to explore the source of the effect size by 
emphasizing /? weights of the respective predictors (Courville & 
Thompson, 2001) . This is reasonable in simple regression or in 
multiple regression (MR) with two or more predictors, all of 
which are completely uncorrelated with each other; but is 
reasonable only in these cases. 



INSERT TABLE 1 ABOUT HERE. 



As reflected in Table 1, the biviarate correlation between 
X x and X 2 is 0, which indicates that each of the predictor 
variable shares a non-overlapping area in the variance of the Y 
scores. According to formula (4) (which is always applicable for 
two predictor variables) , the beta weights of X x and X 2 equal to 
their respective Pearson r with the outcome variable (and thus 
ranging from -1 to +1) : 



A = 



v yx x \ r x,x 2 ) 



1 - rz 



A = 



r YX 2 [ r YX, \ r X t X 2 ) 



1 — r 



x,x 2 



( 4 ) 



Since 
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A — r YX, ' Pi ~ T YX 2 

Hence 

/?! = . 3 8 5 , /? 2 = .859 

Moreover, the effect size multiple i? 2 is .887, which 
corresponds to the following calculation: 

R Y(X l ,X 2 ) = /?! (r Y x l ) + Pi ( r YX 2 ) 

= r YX x +r YX 2 

= .385 2 +.859 2 
= .887 

It is evident that X x explains 14% of the criterion whereas 
X 2 contributes 75%. By examining the beta weights, we can claim 
that X 2 supercedes X 1 in accounting for a larger proportion of 
the Y information and thus does a better job of predicting score 

on the latent variable Y . This is illustrated by the Figure 1 
Venn diagram. 



INSERT FIGURE 1 ABOUT HERE. 

Multicollinearity 

However, it is fallacious (and erroneous) to believe that 
once the job of interpreting beta weight has been accomplished, 
the interpretation of the analysis will be free of suspicion. As 
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many other scholars have suggested, the beta weight is not the 
only issue that counts. In fact, structure coefficient (or the 
Pearson r between predictors and the dependent variable, both of 
which will yield identical results) is equally important in 
terms of interpretation, if not more (Cooley & Lohnes, 1971; 
Pedhazur, 1997; Thompson & Borrello, 1985) . When 
multicollinearity (or collinearity) is present between the 
predictors in multiple regression, both beta weights and 
structure coefficients must be interpreted. 

Considering the reality that not all predictors are highly 
uncorrelated or "researchers purposely introduce collinearity 
when using multiple measures of variables in which they have 
greater interest or which are more important from a theoretical 
point of view" (Thompson & Borrello, 1985, p. 204), the sole 
reliance on beta weights may create serious interpretation 
problems. As Pedhazur (1982) stated: 

It should be clear now that high multicollinearity may lead 
not only to serious distortions in the estimations of 
regression coefficients but also to reversals in their 
signs. Therefore, the presence of high collinearity poses 
serious threats to the interpretation of the regression 
coefficients as indices of effects, (p. 246) 
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Consequently, the advocacy for the interpretation of structure 
coefficient in regression research has emerged (Cohen, 1968; 
Cooley & Lohnes, 1971; Dunlap & Landis, 1998) . 

A structure coefficient is defined as the zero-order 
bivariate correlation between a given predictor variable X and 

the latent (or synthetic) variable Y . It should be noted that a 
structure coefficient differs from a Pearson r between a given 
predictor and the measured variable Y by the formula: 

r S = r YX / ^ 

Heuristic Example #2 

In presence of three predictor variables which are all 
correlated to each other, the interpretation of regression 
becomes complicated. 

As shown in Table 2, if we only center our attention on 
beta weights of the three predictors, we might assert that X 2 
has little or no effect on the outcome variable because /? 2 
equals to zero (or near zero depending on how many rounding 
decimals we chose) . Therefore, in this regression model, X 2 
contributes little in producing the effect size. Similarly, we 
might judge that X 3 is the best predictor among the three 

( /? 3 = . 880827 ) , followed by X x ( /3 X = . 117911) . However, when the 
attempt is to investigate the respective relationship of each X 
with Y , it is surprising to find that X 2 has a near perfect 
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relationship with the latent variable ( r s = . 9108653) , while X 1 
only has a moderate correlation (^=.57610974), which 
contradicts with the previous conclusion. 



INSERT TABLES 2 AND 3 ABOUT HERE. 

A further exploration of this special case reveals that X 2 
and X 3 are highly correlated with each other with a Pearson r 
of .946. The correlation matrix in Table 3 also demonstrates 
that the three predictors are actually 'combined' with each 
other (f Xi x 2 =-221; r xx - .483) . Hence, the sum of the r 2 of each 

predictor with Y will be much greater than the multiple R 
square (.890) . An explanation can be reached by consulting 
Figure 2 . 



INSERT FIGURE 2 ABOUT HERE. 

Obviously, X 2 and X 3 account for most of the common area, 
and the structure coefficient of X 3 is bigger than that of X 2 , 
which has simply arbitrarily not been granted credit to yield 
part of the effect size (the overlapping area with Y is 
circumscribed within that of X x and X 3 ) . However, this doesn't 
mean X 2 assumes no predictive impact in explaining the variance 
of Y . To the contrary, X 2 turns out to be a very qualified 
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predictor! In fact, when holding the other two predictors 
constant, X 2 serves as a predictor almost as significant as X 3 
( r s = . 9939596) , but more predictive than X l . 

This example illustrates that the sole reliance on beta 
weights could have caused serious interpretive problems. Because 
beta weights only work appropriately in cases when predictors 
are perfectly uncorrelated with each other, the computation of 
structure coefficients becomes a must in any multicollinearity 
situation when the research interest is to assess the predictive 
impact of each predictor. 

Summary 

Beta weights can be addressed to identify the merit of any 
predictor uncorrelated with other independent variables. On the 
other hand, structure coefficients can be applied to evaluate 
the relative predictive importance of a single predictor on the 

latent variable Y, to indicate how strongly each independent 
variable influences the criterion variable. Structure 
coefficients are not affected by multicollinearity. 

Even though some scholars have denied the use of structure 
coefficients in multiple regression (see Harris, 1992), Thompson 
(1992) proposed that "... thoughtful researcher should always 
interpret either (a) both the beta weights and the structure 
coefficients or (b) both the beta weights and the bivariate 
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correlations of the predictors with Y" (p. 14) . By consulting 
both beta weights and structure coefficients, researchers can 
report unbiased and valid regression results by balancing their 
attention to both interpretation perspectives. Chapters 8 and 9 
in Thompson (2006) provide more detail on these various issues. 
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Table 1 

Correlation Matrix for Example #1 





Y 


Xi 


x 2 


Y 








Xi 


0.385 






x 2 


0.859 


0.000 


1.000 



Note. R 2 =.877. 
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Table 2 



Beta Weights 


and Structure Coefficients 


of Example #2 


Predictors 


Standardized Coefficients 

P 


Structure Coefficients 

r s 


Xi 


0 .117911 


0.57610974 


x 2 


8 . 680040E-15 


0.9108653 


X 3 


0.880827 


0.99399596 
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Table 3 

Correlation Matrix for Example #2 





Y 


X, X 2 X 3 


Y 






Xi 


0.554 




X 2 


0.859 


0.221 


X 3 


0.938 


0.483 0.946 1.000 


Note. R 2 = 


. 89. 







Figure 2. Venn diagram for three correlated predictors. 



